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ABSTRACT 

We  present  an  all-sky  sample  of  « 1.4  million  active  galactic  nuclei  (AGNs)  meeting  a  two-color  infrared 
photometric  selection  criteria  for  AGNs  as  applied  to  sources  from  the  Wide-field  Infrared  Survey  Explorer  final 
catalog  release  (A11WISE).  We  assess  the  spatial  distribution  and  optical  properties  of  our  sample  and  find  that  the 
results  are  consistent  with  expectations  for  AGNs.  These  sources  have  a  mean  density  of^38  AGNs  per 
square  degree  on  the  sky,  and  their  apparent  magnitude  distribution  peaks  at  g  «  20,  extending  to  objects  as  faint  as 
g  &  26.  We  test  the  AGN  selection  criteria  against  a  large  sample  of  optically  identified  stars  and  determine  the 
“leakage”  (that  is,  the  probability  that  a  star  detected  in  an  optical  survey  will  be  misidentified  as  a  quasi-stellar 
object  (QSO)  in  our  sample)  rate  to  be  <4.0  x  10-5.  We  conclude  that  our  sample  contains  almost  no  optically 
identified  stars  «0.041%),  making  this  sample  highly  promising  for  future  celestial  reference  frame  work  as  it 
significantly  increases  the  number  of  all-sky,  compact  extragalactic  objects.  We  further  compare  our  sample  to 
catalogs  of  known  AGNs/QSOs  and  find  a  completeness  value  of  >84%  (that  is,  the  probability  of  correctly 
identifying  a  known  AGN/QSO  is  at  least  84%)  for  AGNs  brighter  than  a  limiting  magnitude  of  R<  19.  Our 
sample  includes  approximately  1.1  million  previously  uncataloged  AGNs. 
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1.  INTRODUCTION 

The  International  Celestial  Reference  Frame  (ICRF)  is  the 
realization,  at  radio  wavelengths,  of  the  International  Celestial 
Reference  System  (ICRS),  the  solar-barycentric,  quasi-inertial 
fundamental  reference  system  adopted  by  the  International 
Astronomical  Union  (Arias  et  al.  1995).  The  second  realization, 
ICRF2,  consists  of  3414  compact  radio  objects  (i.e.,  quasi- 
stellar  objects  (QSOs)),  of  which  295  are  “defining  sources” 
(Fey  et  al.  2015).  These  QSOs  can  be  nearly  ideal  reference 
frame  objects,  as  they  present  no  significant  parallax  or  proper 
motion  and,  when  properly  selected,  they  have  minimal  spatial 
structure  or  variability.  Using  Very  Fong  Baseline  Interfero¬ 
metry  (VFBI)  techniques,  ICRF2  defined  source  position  errors 
have  an  estimated  noise  floor  of  40  /ias.  Plans  for  the  third 
realization  of  the  ICRF  (ICRF3),  planned  for  2018,  focus  on 
increasing  the  density  of  the  source  catalog  and  improving  the 
spatial  uniformity  (especially  at  negative  declinations),  improv¬ 
ing  the  astrometric  accuracy  of  the  non-defining  sources  to 
bring  them  close  to  the  accuracy  of  the  defining  sources,  and 
extending  the  ICRF  to  higher  frequencies  to  reduce  the  effects 
of  source  structure  on  position  (Jacobs  &  ICRF3,  2014). 

Much  of  the  motivation  for  this  improvement  to  the  radio 
reference  frame  is  driven  by  the  European  Space  Agency’s 
(ESA’s)  Gaia  mission,  launched  in  2013.  Gaia  is  a  space- 
based,  astrometric,  photometric,  and  radial  velocity  all-sky 
survey  at  optical  wavelengths.  Over  the  next  few  years,  Gaia 
will  deliver  astrometric  catalogs  that  are  expected  to  be  adopted 
as  the  next  optical  representation  of  the  fundamental  reference 
frame.  Unlike  its  predecessor,  Hipparcos ,  which  was  limited  to 
observing  stars  only,  Gaia ,  with  its  limiting  magnitude  of 
V  «  20,  will  directly  observe  hundreds  of  thousands  of 
extragalactic  objects.  By  tying  together  the  positions  of 
reference  objects  (i.e.,  QSOs)  observed  in  both  the  radio  and 
optical,  the  Gaia  reference  frame  will  be  brought  into 


alignment  and  rotationally  stabilized  with  respect  to  the  radio 
reference  frame. 

One  critical  limitation  in  aligning  optical  and  radio  reference 
frames  is  the  problem  of  discrepancies  (or  “offsets”)  in  the 
position  between  radio  and  optical  measurements.  Offsets  can 
be  due  to  a  variety  of  underlying  physical  differences  between 
the  emission  mechanisms  of  active  galactic  nuclei  (AGNs)  in 
the  optical  and  radio.  First,  for  AGN-dominated  galaxies, 
optical  emission  is  thought  to  originate  from  the  compact 
accretion  disk  surrounding  the  supermassive  black  hole 
(SMBH),  while  radio  emission  can  be  either  compact  or 
extended,  depending  on  the  orientation  of  the  jet  with  respect  to 
the  observer.  Second,  for  non- AGN-dominated  galaxies,  an 
optical  centroid  can  be  shifted  relative  to  the  radio  position 
because  of  contamination  from  the  host  galaxy.  Depending  on 
the  distance  to  the  source,  the  optical  position  can  be 
significantly  different  from  the  position  of  a  compact  radio 
core.  Third,  variability  in  either  the  jet  or  the  accretion  disk  can 
cause  apparent  changes  in  the  overall  position  of  the  AGN  in 
the  optical  or  the  radio  over  time,  making  the  AGN  an 
unreliable  tie  source  for  use  in  defining  the  reference  frame. 

Position  offsets  as  a  function  of  wavelength  in  this  class  of 
sources  have  been  extensively  studied,  beginning  with  da  Silva 
Neto  et  al.  (2002)  for  the  correlation  between  offsets  and 
X-band  (8.4  GHz)  structure  in  ICRF  sources,  and  extending  to 
large  systematic  analyses  of  radio  sources  overlapping  with  the 
Sloan  Digital  Sky  Survey  (SDSS;  Orosz  &  Frey  2013).  Recent 
optical  observations  of  ICRF  sources  found  offsets  between  the 
optical  and  radio  positions  at  the  10  mas  level,  leaving  the 
authors  to  conclude  that  unknown  effects,  most  likely  having  to 
do  with  underlying  astrophysical  phenomena,  induce  this  real 
offset  between  radio  and  optical  positions.  Such  an  offset 
would  introduce  a  fundamental  limit  of  about  0.5  mas  to  the 
accuracy  of  the  alignment  between  Gaia  and  the  ICRF 
(Zacharias  &  Zacharias  2014),  and  would  be  exacerbated 
should  the  sources  be  time  varying  in  nature.  These  offsets  can 
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potentially  be  minimized  by  selecting  reference  frame  objects 
that  minimize  problematic  features,  such  as  photometric 
variability  or  optical/radio  structure. 

It  is  thus  crucial  to  identify  and  characterize  as  many  AGNs/ 
QSOs  as  possible  in  order  to  maximize  the  number  of  reference 
frame  tie  objects.  Until  recently,  the  number  of  known  QSOs 
was  quite  limited,  of  the  order  of  a  few  tens  of  thousands  over 
the  entire  sky.  The  SDSS-DR12  Quasar  Catalog  (DR12Q), 
using  data  from  the  B  ary  on  Oscillation  Spectroscopic  Survey 
(BOSS;  Dawson  et  al.  2013)  and  photometrically  selected  QSO 
candidates  from  SDSS,  contains  297,301  spectroscopically 
confirmed  QSOs  over  approximately  9200  deg2  («22%  of  the 
sky;  I.  Paris  2015,  private  communication).  Gaia  is  expected  to 
observe  approximately  half  a  million  QSOs  over  the  course  of 
its  mission  (Claeskens  et  al.  2006;  Mignard  2012).  A  large,  all¬ 
sky,  coherent  catalog  of  well-defined  zero-parallax,  zero-proper 
motion  sources  that  includes  but  also  extends  beyond  the 
network  of  ICRF  sources  would  permit  an  extensive  study  of 
all  of  the  physical  offsets  associated  with  radio  and  optical 
reference  frames,  object  positional  stability,  and  object 
variability,  using  currently  available  ground-based  resources, 
such  as  the  United  States  Naval  Observatory  (USNO)  Robotic 
Astrometric  Telescope  (URAT,  Zacharias  2005;  Zacharias 
et  al.  2015),  and  the  Panoramic  Survey  Telescope  and  Rapid 
Response  System  (Pan-STARRS,  Kaiser  et  al.  2010;  Morgan- 
son  et  al.  2014). 

In  order  to  identify  and  characterize  as  many  AGNs/QSOs 
uniformly  selected  across  the  sky  as  possible,  we  apply  a  two- 
color  AGN  selection  criterion  to  the  Wide-field  Infrared  Survey 
Explorer  (WISE;  Wright  et  al.  2010)  database.  We  describe  our 
methodology  in  the  following  sections,  and  discuss  the 
properties  of  the  resultant  sample  and  the  reliability  of  the 
selection  criteria  we  have  chosen  for  detecting  AGNs.  Our 
resultant  sample  contains?^  1.4  million  AGNs,  of  which  « 1.1 
million  are  previously  uncataloged  and  most  are  compact 
(subtending  The  goal  of  generating  this  sample  is  to 


provide  a  large  sample  of  point-like  extragalactic  objects  with 
minimal  stellar  contamination  for  the  purpose  of  supporting 
future  photometric,  astrometric,  and  variability  studies;  main¬ 
tenance  and  improvements  of  the  celestial  reference  frame;  and 
general  astrophysical  and  cosmological  uses.  This  paper  is 
outlined  as  follows.  In  Section  2,  we  review  the  WISE  mission 
and  the  A11WISE  source  catalog  and  its  properties.  In  Section  3, 
we  detail  the  AGN/QSO  selection  criteria  we  use.  Finally,  in 
Section  4,  we  discuss  the  resultant  sample,  as  well  as  some  of 
the  properties  of  the  sample  sources  derived  using  the  SDSS 
data  release  12  (SDSS-DR12,  Alam  et  al.  2015). 


2.  ALLWISE  CATALOG 

The  WISE  survey  is  an  all-sky  mid-IR  survey  at  3.4,  4.6,  12, 
and  22  pm  (Wl,  W2,  W3,  and  W4,  respectively)  conducted 
between  2010  January  7  and  August  6,  during  the  cryogenic 
mission  phase,  and  first  made  available  to  the  public  on  2011 
April  14.  WISE  has  angular  resolutions  of  671,  674,  675,  and 
1270  in  its  four  bands.  The  A11WISE  data  release,  which  we 
use  for  this  work,  incorporates  data  from  the  WISE  Full 
Cryogenic,  3-Band  Cryo,  and  NEOWISE  Post-Cryo  survey 
(Mainzer  et  al.  2014)  phases,  which  were  coadded  to  achieve  a 


The  increase  in  depth  is  primarily  due  to  the  additional  coverage  in  the  Wl 
and  W2  bands  during  the  Post-Cryo  survey  phase,  although  photometry  in  all 
four  bands  has  been  improved  due  in  part  to  better  background  estimation. 


depth  of  coverage  «0.4  mag  deeper  than  previous  data 
releases.1  A11WISE  contains  the  positions,  apparent  motions, 
magnitudes,  and  point-spread  function  (PSF)-profile  fit  infor¬ 
mation  for  almost  748  million  objects.  Astrometric  calibration 
of  sources  in  the  WISE  catalog  was  done  by  correlation  with 
bright  stars  from  the  2MASS  point  source  catalog,  and  the 
astrometric  accuracy  for  sources  in  the  A11WISE  release  was 
further  improved  by  taking  into  account  the  proper  motions  of 
these  reference  stars,  taken  from  the  fourth  USNO  CCD 
Astrograph  Catalog  (UCAC4,  Zacharias  et  al.  2013).  A 
comparison  with  ICRF  sources  shows  that  A11WISE  Catalog 
sources  in  the  brightness  range  of  8  <  Wl  <  12  mag  have 
positional  accuracies  to  within  50  mas,  and  half  of  these 
sources  have  positional  accuracies  to  within  20  mas.  For  more 
details  on  the  WISE  mission,  see  Wright  et  al.  (2010). 

3.  AGN  SELECTION 

There  are  numerous  mid-IR  color  selection  criteria  in  the 
literature,  such  as  the  Spitzer  two-color  criteria  in  Lacy  et  al. 
(2004),  Stem  et  al.  (2005),  Lacy  et  al.  (2007),  and  Donley  et  al. 
(2012),  the  WISE  two-color  criteria  in  Jarrett  et  al.  (2011),  the 
WISE  one-color  criteria  of  Stem  et  al.  (2012)  and  Assef  et  al. 
(2013),  and  the  WISE  two-color  criteria  of  Mateos  et  al.  (2012). 
All  of  these  color  criteria,  while  defined  using  different  AGN 
subsamples,  are  in  general  agreement  with  each  other  and  rely 
on  the  fact  that  AGNs  separate  cleanly  from  stars  and  star¬ 
forming  galaxies  in  mid-IR  color  space  (for  a  recent  empirical 
discussion  of  how  objects  differentiate  in  WISE  color  space,  see 
Nikutta  et  al.  2014).  The  reason  for  this  separation  is  that  (a) 
stars  have  nearly  blackbody  SEDs  with  flux  densities  dropping 
at  wavelengths  longer  than  a  few  microns,  and  (b)  while 
reprocessed  photons  from  dust  heating  around  star  formation 
peaks  around  a  few  tens  of  microns,  the  hard  radiation  field 
from  an  AGN  accretion  disk  heats  dust  in  the  surrounding  toms 
up  to  the  dust  sublimation  temperature  (1000-1500  K)  leading 
to  a  relatively  flat  power-law  spectmm  that  is  easily 
distinguishable  from  the  aforementioned  non- AGN  SEDs.  It 
is  important  to  note  that  because  the  mid-IR  is  insensitive  to 
extinction,  mid-IR  color  selection  can  pick  out  heavily 
obscured  or  even  Compton-thick  (AH  >  1024cm-2)  AGNs 
(Mateos  et  al.  2013)  that  are  optically  indistinguishable  from 
star-forming  galaxies  (see,  for  example,  Figure  1  in  Donley 
et  al.  2012),  especially  at  higher  AGN  luminosities  (Assef 
et  al.  2013;  Messias  et  al.  2014;  Stern  et  al.  2014).  This 
eliminates  optical  selection  effects  and  can  yield  much  larger 
and  more  statistically  complete  samples  of  AGNs. 

In  choosing  a  mid-IR  color  selection  criteria,  we  required 
that  the  criteria  be  defined  directly  from  WISE  data,  using  only 
WISE  data;  this  mles  out  the  criteria  of  Lacy  et  al.  (2004),  Stem 
et  al.  (2005),  Lacy  et  al.  (2007),  and  Donley  et  al.  (2012), 
which  apply  to  Spitzer  IRAC  data.  This  avoids  uncertainties 
inherent  in  transforming  Spitzer  magnitudes  into  WISE 
magnitudes.  This  criteria  also  rules  out  the  SDSS  -WISE  color 
criteria  of  Wu  et  al.  (2012),  which  would  limit  our  sample  to 
the  SDSS  footprint.  We  further  required  that  the  criteria  be 
defined  directly  from  a  clean,  highly  reliable  sample  of  AGNs 
and  QSOs  not  defined  empirically  from  WISE  data;  this  mles 
out  the  criteria  of  Jarrett  et  al.  (2011).  Finally,  we  required  that 
the  criteria  be  a  two-color  selection,  involving  W2-W3,  which 
excludes  the  Stem  et  al.  (2012)  and  Assef  et  al.  (2013)  criteria. 
This  is  because  we  do  not  restrict  our  sample  to  high  Galactic 
latitudes,  and  so  contamination  by  brown  dwarfs  may  occur  if 
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we  only  use  W1-W2.  We  do  not  know  a  priori  that  our  sources 
are  extragalactic,  and  some  brown  dwarfs  can  share  the  first 
color  axis  W1-W2  with  AGNs  due  to  methane  absorption  at 
3.3  fi m,  reducing  emission  in  the  Wl  band  (e.g.,  Noll 
et  al.  2000). 2  In  developing  an  all-sky  catalog  of  mid-infrared 
zero-proper  motion,  zero-parallax  objects,  it  is  essential  to 
minimize  brown  dwarf  contamination,  as  many  brown  dwarfs 
are  within  a  few  parsecs  and  therefore  have  very  high  proper 
motions,  as  high  as  ~8"yr-1  (Luhman  2014).  To  pick  out 
Spitzer  IRAC-selected  AGNs  down  to  a  limiting  magnitude  of 
W2  <  15,  the  WISE  one-color  AGN  selection  criteria  of  Stem 
et  al.  (2012)  and  Assef  et  al.  (2013)  are  both  very  reliable,  with 
95%  and  >90%  reliability,  respectively.  However,  these  color 
cuts  also  overlap  significantly  with  the  mid-IR  color  space 
occupied  by  brown  dwarfs.  For  example,  using  the  representa¬ 
tive  sample  of  the  A11WISE  catalog  described  in  Section  4.3, 
out  of  1056  sources  identified  as  AGNs  using  the  Stern  et  al. 
(2012)  criterion,  62  (5.9%)  also  qualify  as  brown  dwarfs 
according  to  the  combined  empirical  criterion  for  dwarfs  with 
spectral  type  ^T5  and  nearby  L  and  T-type  dwarfs  of 
Kirkpatrick  et  al.  (2011).  Similarly,  of  the  1892  sources 
identified  as  AGNs  using  the  “R90”  criterion  of  Assef  et  al. 
(2013),  59  (3.1%)  fall  within  the  combined  Kirkpatrick  et  al. 
(2011)  criterion. 

With  these  considerations,  we  chose  the  two-color  selection 
criterion  of  Mateos  et  al.  (2012),  who  use  the  Bright  Ultrahard 
XMM-Newton  Survey  (BUXS),  one  of  the  largest,  flux-limited 
samples  of  “ultra  hard”  (4.5-10  keV)  X-ray  selected  sources,  to 
define  a  WISE  AGN  selection  criterion.  This  reduces  bias 
against  heavily  absorbed  AGNs,  and  the  BUXS  is  comprised 
mostly  (56.2%)  of  Type  1  AGNs  (emission  line 
widths  >  1500  km  s1);  the  remainder  are  almost  entirely  Type 
2  AGNs.  It  is  well  established  that  the  mid-IR  luminosities  of 
AGNs  correlate  strongly  with  their  hard  X-ray  luminosities 
(Lutz  et  al.  2004;  Gandhi  et  al.  2009;  Ichikawa  et  al.  2012; 
Mateos  et  al.  2015;  Stem  2015;  Asmus  et  al.  2015;  see  also  the 
relation  in  Secrest  et  al.  2015),  as  Compton  up-scattering  of  UV 
photons  from  the  accretion  disk  into  the  X-ray  regime  is  largely 
proportional  to  the  accretion  disk  luminosity.  In  terms  of 
overlap  with  the  mid-IR  color  space  shared  with  brown  dwarfs, 
of  the  860  sources  in  our  representative  sample  of  the  A11WISE 
catalog  that  are  classified  as  AGNs  according  to  the  Mateos 
et  al.  (2012)  criterion,  only  7  (0.8%)  would  qualify  as  L  or  T 
dwarfs  according  to  the  combined  Kirkpatrick  et  al.  (2011) 
criteria. 

We  take  all  sources  from  the  A11WISE  catalog  following 
Equations  (3)  and  (4)  from  Mateos  et  al.  (2012)  and  we  require 
that  all  of  our  sources  have  S/N  ^  5  in  the  first  three  bands 
(wl,2,3snr>  =  5),  as  recommended  in  Mateos  et  al.  (2012); 
as  a  further  constraint,  we  limit  our  results  to  those  with 
cc_flags  =  “0000,”  meaning  that  the  sources  are  unaffected 
by  known  artifacts  such  as  diffraction  spikes,  persistence, 
halos,  or  optical  ghosts.  We  subdivided  this  query  into  sources 
above  6  ^  0°  and  source  below  8  <  0°  to  make  our  queries 
tractable.  We  concatenated  the  resultant  tables  using  topcat, 
version  4. 2-3. 3  For  the  remainder  of  this  paper,  we  refer  to 
AGNs  selected  in  the  manner  outlined  above  as  mid-IR  AGNs 
(MIR AGNs),  although  we  reiterate  that  there  are  other 
selection  criteria  for  MIRAGNs  in  the  literature. 


2  See  Kirkpatrick  et  al.  (2011),  Cushing  et  al.  (2011),  Mace  et  al.  (2013),  and 
Cushing  et  al.  (2014)  for  some  samples  of  brown  dwarfs  discovered  with  WISE. 

3  http://www.star.bris.ac.uk/ ~mbt/topcat/ 


4.  RESULTS 

4.1.  Sample  Properties 

Our  sample  consists  of  1,354,775  MIRAGNs  spanning  the 
full  sky.  In  Figure  1,  we  show  a  density  plot  of  MIRAGNs 
across  the  sky,  clearly  showing  that  our  AGN  selection  criteria 
is  effectively  selecting  objects  outside  of  the  Galaxy.  Our 
sample  is  also  relatively  uniform  across  the  sky.  In  Figure  2,  we 
show  the  source  density  of  MIRAGNs.  By  randomly  sampling 
106  1-deg2  areas  across  the  sky,  we  calculate  a  mean  source 
density  of  «38  deg-2,  with  10%  and  90%  thresholds  of  15 
deg-2  and  59  deg-2,  respectively.  We  note  that  the  reasons  for 
the  deviation  from  a  truly  uniform  distribution  centered  at 
N/4tt  sr  =  33  deg-2  are  the  over-density  of  sources  at  the 
ecliptic  poles  (due  to  deeper  WISE  coverage)  and  the  under¬ 
abundance  along  the  Galactic  plane  (due  to  source  confusion); 
both  of  these  effects  are  visible  in  Figure  1. 

Almost  none  of  the  MIRAGNs  in  our  sample  (<2%)  have 
PSF  profile  fits  x?ed  >  3.0  in  the  highest-resolution  Wl  band, 
indicating  that  the  vast  majority  of  our  sources  are  unresolved 
by  WISE  and  therefore  subtend  angular  scales  less  than  ~6".  In 
Figure  3,  we  show  SDSS  thumbnails  of  a  random  selection  of 
25  sources  in  our  sample,  demonstrating  that  the  majority  of 
MIRAGNs  in  our  sample  are  indeed  compact  at  optical 
wavelengths  as  well. 

4.2.  Optical  Properties 

In  order  to  characterize  the  optical  properties  of  our  sample, 
we  cross-matched  it  to  SDSS-DR12,  which  is  the  final  data 
release  of  SDSS-III  (Eisenstein  et  al.  2011), 4  within  a  radial 
tolerance  of  R  <  1 ",  obtaining  424,366  matches.  To  determine 
the  fraction  of  false  positive  position  matches  (that  is, 
incorrectly  correlating  an  object  in  our  sample  with  a  different 
SDSS  object  due  to  random  positional  agreement),  we 
performed  the  same  match  on  a  scrambled  version  of  our 
sample  coordinates,  determining  that  less  than  1%  of  our  cross¬ 
matches  are  false  positive  position  matches  between  the  two 
catalogs. 

With  our  list  of  cross-matches  between  our  sample  and 
SDSS-DR12,  we  queried  the  PhotoObjAll  table  to  obtain 
optical  photometric  measurements  of  our  sources,  requiring  the 
clean  photometric  quality  flag  be  equal  to  1,  and  obtained 
193,637  sources.  We  similarly  queried  the  SpecObj  All  table 
to  obtain  spectroscopic  redshift  information  for  our  sources  and 
obtained  39,981  sources.  The  following  describes  the  statistics 
of  the  sample  based  on  these  queries. 

Source  Extent.  In  Figure  4,  we  show  the  optical  extents  of 
sources  in  our  sample,  given  by  the  psffwhm_r  parameter. 
The  majority  are  not  extended,  with  a  mean  FWHM  of  1.2  ± 
0.2  arcsec,  comparable  to  the  seeing  limit  of  SDSS  (Stoughton 
et  al.  2002),  further  underlining  the  power  of  our  mid-IR 
selection  strategy  at  picking  out  unresolved  AGNs/QSOs. 

Distances.  In  Figure  5,  we  show  the  spectroscopic  redshifts 
of  MIRAGNs  in  our  sample,  which  extend  to  about  z  ~  3.  Note 
that  SDSS  spectra  come  from  targeted  observations,  so  this 
subsample  of  MIRAGNs  suffers  from  a  selection  bias  and 
should  not  be  considered  to  be  representative  of  the  physical 
distribution  of  redshifts  for  the  entire  sample. 

Magnitudes.  In  Figure  6,  we  show  the  distribution  of  Wl- 
and  g-band  magnitudes  for  our  sources.  In  the  mid-IR,  1%  of 


4  SDSS-DR12  covers  14,555  square  degrees,  or  about  35.3%  of  the  sky. 


3 


The  Astrophysical  Journal  Supplement  Series,  221:12  (lOpp),  2015  November 


Secrest  et  al. 


MIRAGNs 


Figure  1.  Density  plot,  Aitoff  projection  in  Galactic  coordinates,  of  our  full  sample  of  AGN/QSO  sources.  The  under-density  of  sources  along  the  Galactic  plane 
below  \b\  <  15°  is  due  to  the  AGN  color  criterion  effectively  excluding  stars  and  other  Galactic  sources,  combined  with  the  source  confusion  limit  of  WISE  (Wright 
et  al.  2010).  The  increased  number  density  at  the  ecliptic  poles  is  due  to  deeper  WISE  coverage. 


deg  2 

(a) 


Figure  2.  (a)  Normalized  histogram  of  MIRAGNs  per  deg  2.  (b)  Corresponding 
thresholds  of  15  deg-2  and  59  deg-2,  respectively. 

our  sources  have  Wl  [3.4//m]  magnitudes  less  than  13,  68% 
have  Wl  magnitudes  less  than  16,  and  100%  have  Wl 
magnitudes  less  than  19.  Optically,  2%  of  our  sources  have  g 
magnitudes  less  than  18,  33%  have  g  magnitudes  less  than  20, 
75%  have  g  magnitudes  less  than  22,  and  96%  have  g 
magnitudes  less  than  24. 

Note  that  the  g-band  magnitudes  extend  well  past  the  —20 
mag  Gaia  limit.  Extrapolating  the  number  of  SDSS  DR  12- 
matched  sources  with  g-band  magnitudes  of  less  than  20  to  the 
entire  sky,  we  predict  that  this  sample  contains  -1.8  x  105  of 
the  AGNs/QSOs  that  will  be  observed  by  Gaia. 

4.3.  Stellar  Contamination 

In  creating  a  sample  of  extragalactic  sources  without 
knowledge  of  their  redshifts,  it  is  vital  to  be  able  to  adequately 


Lulative  histogram.  The  mean  source  density  is  ^38  deg  2,  with  10%  and  90% 

avoid  contamination  by  stars,  especially  for  unresolved 
sources.  While  stars  and  AGNs  occupy  completely  different 
loci  in  mid-IR  color  space  due  to  their  fundamentally  different 
SEDs  (see,  for  example,  Figure  12  in  Wright  et  al.  2010),  we 
nonetheless  aim  to  quantify  any  possible  contamination  of  our 
sample  by  stars.  To  do  this,  we  calculate  the  number  of  stars  in 
our  sample  as  follows: 

V.s.AGN  =  V-/c  -/c,s  •/c.s.AGN-  (0 

where  N  is  the  size  of  the  A11WISE  catalog  (747,634,026),  fc  is 
the  fraction  of  the  A11WISE  catalog  that  has  clean  WISE 
photometry  as  per  Section  3  (for  the  rest  of  this  work,  “clean” 
WISE  photometry  means  this),  /c?s  is  the  fraction  of  those  clean 
sources  that  are  stars,  and  /C,S,AGN  is  the  fraction  of  stars  with 
clean  WISE  photometry  that  survive  the  AGN  cut  we  employ. 
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Figure  3.  Random  sample  of  25  sources  taken  from  our  A11WISE-SDSS  DR12  cross  match,  showing  the  typical  angular  extent  of  sources  in  our  sample.  No  cuts  or 
photometry  requirements  were  made. 


To  estimate /c,  we  constructed  a  representative  sample  of  the 
A11WISE  catalog  by  querying  the  A11WISE  catalog  with  a  list 
of  1  million  coordinates  randomly  sampled  across  the  celestial 
sphere,  returning  all  sources  that  were  within  a  radius  of 
R  <  10".  This  returned  389,424  unique  sources.  Of  these 
sources,  13,506  have  clean  WISE  photometry,  or  3.47%. 


Estimating  /c?s  can  be  done  by  utilizing  the  fact  that  stars 
cleanly  separate  from  extragalactic  objects  in  mid-IR  color 
space,  especially  in  W2-W3.  This  is  shown  more  explicitly  in 
Figure  7,  which  depicts  a  clearly  bimodal  distribution  of  stars 
and  extragalactic  objects  in  W2-W3.  We  select  as  stars  all  those 
objects  with  clean  WISE  photometry  and  W2-W3  <  2,  although 
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FWHM  (arcsec) 

Figure  4.  Distribution  of  r-band  PSF  FWHM  in  our  sample  cross-matched  with 
SDSS  DR12  PhotObjAll.  Nearly  all  193,637  sources  in  this  subsample  are 
unresolved,  as  we  would  expect. 


z 


Figure  5.  Distribution  of  spectroscopic  redshifts  of  our  sample  cross-matched 
with  SDSS  DR12  SpecObjAll. 

there  is  some  mixing  of  stars  and  extragalactic  objects  between 
W2-W3  «  1-2,  and  elliptical  galaxies  have  star-like  mid-IR 
colors,  so  this  selection  should  be  viewed  as  conservative. 
From  this,  we  estimate  that  approximately  55.6%  of  the  sources 
in  the  A11WISE  catalog  with  clean  photometry  are  stars. 

Estimating  the  value  of  /c,s,agn  can  be  done  by  constructing 
an  unambiguous  sample  of  stars  and  then  cross-matching  the 
sample  with  the  A11WISE  catalog.  We  created  such  a  sample  of 
stars  from  the  all-sky  PPMXL  catalog  (Roeser  et  al.  2010),  a 
catalog  of  ~900  million  sources  with  optical  photometric  and 
astrometric  information  from  USNO-Bl.O  and  2MASS,  which 
is  complete  down  to  approximately  V  «  20.  We  selected  all  of 
the  sources  in  the  PPMXL  catalog  with  ZLband  magnitudes  less 
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mag 


Figure  6.  Distribution  of  W 1-  (Vega)  and  g-band  (AB)  apparent  magnitudes. 


W2  -  W3  (mag) 

Figure  7.  Distribution  of  W2-W3  for  a  representative  sample  of  sources  from 
the  A11WISE  catalog,  randomly  sampled  across  the  celestial  sphere. 

than  12,  as  this  excludes  all  AGNs  and  most  extragalactic 
objects  in  the  PPMXL  catalog.5  We  further  required  that  the 
absolute  values  of  the  proper  motions  in  R.A.  and  decl.  be  less 
than  150masyr_1,  as  this  avoids  spurious  entries  in  the 
PPMXL  catalog.  We  then  cross-matched  this  sample  of  stars  to 
the  A11WISE  catalog  within  R  <  1",  returning  499,724  sources 
with  clean  WISE  photometry.  Of  these,  only  20  (0.0040%)  fall 
within  our  AGN  selection  criterion.6  To  estimate  the  number  of 


5  Asa  quality  control,  we  required  that  the  difference  in  B  magnitude  between 
the  first  and  second  epochs  of  USNO-Bl.O  be  less  than  0.5  mag. 

6  Because  of  the  brighter  magnitudes  of  our  star  sample,  saturation  effects  in 
the  corresponding  A11WISE  data  become  important.  To  avoid  stars  with 
spurious  A11WISE  magnitudes,  we  removed  any  with  W\  <  8,  W2  <  7,  and 
W3  <  3.8.  This  removed  23  stars  that  would  have  been  classified  as  AGNs 
according  to  our  criterion.  This  is  not  a  significant  effect  for  our  sample  of 
AGNs,  however,  as  it  affects  less  than  0.092%  of  our  sample. 
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sources  that  would  have  fallen  within  our  AGN  criterion  by 
chance  mismatch,  we  cross-matched  a  scrambled  version  of  our 
list  of  star  coordinates  to  the  A11WISE  catalog,  returning  3,626 
“matches”  within  R  <  l".  Of  these,  7  fall  within  our  AGN 
selection  criterion,  implying  that  many,  if  not  most  of  the  20 
stars  falling  within  our  AGN  criterion  are  actually  chance 
mismatches.  We  therefore  consider  0.0040%  to  be  an  upper 
limit  to  the  percentage  of  stars  with  WISE  colors  following  our 
criterion. 

Multiplying  these  fractions  together,  Ac?s,agn  ~  580.  Given 
our  conservative  estimates  described  above,  we  interpret  this  as 
the  upper  limit  of  the  total  number  of  AllWISE-observed  stars 
that  leak  into  our  sample.  With  1.4  x  106  sources  in  our 
MIRAGN  sample,  the  expected  contamination  of  the  total 
sample  by  stars  is  therefore  <0.041%.  We  conclude  that  nearly 
100%  of  our  sample  is  uncontaminated  by  stars.  We  define  this 
as  the  “reliability”  of  our  sample.  It  is  important  to  note  that  the 
types  of  stars  found  in  PPMXL,  an  optical  catalog,  are  by  no 
means  the  same  types  of  stars  found  in  A11WISE  and  many 
infrared-bright  stars,  such  as  L-  and  T-type  dwarfs,  are  under¬ 
represented  in  our  star  sample.  We  have  chosen  a  two-color 
MIRAGN  selection  criterion  partly  to  avoid  contamination  by 
brown  dwarfs,  but  we  emphasize  that  our  reliability  analysis 
pertains  specifically  to  optical  survey  work. 


4.4.  Completeness 

While  the  primary  objective  of  this  study  is  to  obtain  a 
reliable  sample  of  extragalactic  sources  using  mid-IR  color- 
selected  AGNs,  it  is  nonetheless  useful  to  explore  the  statistical 
completeness  of  our  sample.  To  do  this,  we  used  the  second 
release  of  the  Large  Quasar  Astrometric  Catalog  (LQAC-2; 
Souchay  et  al.  2012),  which  contains  187,504  quasars, 
including  radio- selected  quasars  from  the  ICRF2,  optically 
selected  quasars  from  SDSS,  and  infrared- selected  quasars 
from  2MASS,  and  thus  represents  a  robust  sample  of  quasars 
over  a  wide  range  of  wavelengths. 

After  cross-matching  with  A11WISE,  we  find  that  93,403 
quasars  from  LQAC-2  have  clean  detections.  The  majority  of 
non-detections  are  due  to  sources  in  LQAC-2  that  are  too  faint, 
having  R  >  19. 7  Of  the  61,377  sources  in  LQAC-2  brighter 
than  this  limit,  51,618  (84.10%)  have  clean  detections  with 
WISE.  Of  these,  46,928  are  MIRAGNs,  or  90.91%.  Broken 
down  by  wavelength-based  source  catalog  (see  Table  1  in 
Souchay  et  al.  2012), 

radio:  84.00%  (15,441/18,391); 

near-IR:  82.47%  (15,973/19,368); 

optical:8  90.91%  (46,927/51,617). 

With  a  magnitude-limited  sample,  MIRAGN  classification  is 
therefore  quite  complete,  even  for  AGNs  selected  from  radio 
surveys.  Finally,  we  note  that  of  the  3414  ICRF2  sources,  1364 
have  clean  WISE  detections  and  R  <  19.  Of  these,  1219 
(89.40%)  are  MIRAGNs. 
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166,033  quasars  in  LQAC-2  have  /?-band  magnitudes  in  the  catalog,  and 
104,656  (63.03%)  have  R  ^  19. 

8  The  difference  of  one  between  this  denominator  and  the  full  sample  is  due  to 
the  R  magnitudes  being  derived  from  complementary  USNO-Bl.O  (Monet 

et  al.  2003)  data,  so  that  one  source  in  LQAC-2  is  a  purely  radio-determined 
AGN  with  a  cross-identification  in  USNO-Bl.O:  most  sources  in  LQAC-2  have 
data  from  multiple  wavelengths. 


It  is  useful  to  compare  the  completeness  of  MIRAGNs  with 
the  number  of  quasars  expected  to  be  discovered  by  Gaia  down 
to  a  magnitude  of  V  =  20  (&5  x  105,  Mignard  2012).  To  do 
this,  we  performed  a  similar  analysis  using  the  DR12Q  catalog 
from  SDSS,  which  contains  297,301  quasars,  44,831  of  which 
have  clean  WISE  detections.  The  majority  of  non-detections  is 
again  due  to  a  limiting  magnitude  of  about  g  <  20.  Of  those 
clean  detections  with  g  <  20,  38,915  (86.8%)  are  MIRAGNs. 
Using  the  g  band  as  a  proxy  for  V ,  23,906/27,093  (88.2%)  of 
the  cleanly  detected  sources  with  g  <  20  are  MIRAGNs. 
Extrapolating  over  the  whole  sky  (the  BOSS  survey 
covers  ^9.2  x  103  deg2),  «8.3  x  104  MIRAGNs  in  our  sample 
with  g  <  20  are  outside  the  SDSS  footprint,  and  are  therefore 
expected  to  be  new. 

It  is  worth  discussing  the  reasons  why  «10%-20%  of  AGNs 
in  LQAC-2  and  DR12Q  within  our  magnitude  limit  are  not 
recovered  in  our  sample  of  MIRAGNs.  In  Figure  8,  we  show 
the  mid-IR  color  distribution  of  (a)  the  LQAC-2  catalog  and  (b) 
the  DR12Q  catalog.  Most  sources  that  do  not  meet  the  criteria 
are  bluer  in  their  W1-W2  colors,  which  is  due  to  two  effects. 
First,  lower  ratios  of  AGN/host  galaxy  luminosities  preclude 
inclusion  by  our  criteria.  For  example,  Mateos  et  al.  (2012) 
found  that  above  a  hard  X-ray  luminosity  of  L2_iokev  ^ 
1044ergs_1,  97.1%  of  type  1  AGNs  (emission  line  widths 
^1500  km  s-1)  and  76.5%  of  type  2  AGNs  (emission  line 
widths  <  1500kms_1)  fall  within  their  criteria.  Below 
£2-10  kev  <  1044ergs_1,  those  percentages  are  84.4%  and 
39.1%,  respectively,  and  contamination  by  the  host  galaxy 
becomes  more  evident.  At  lower  redshifts  (z  <  0.5),  a  larger 
fraction  of  AGNs  in  both  DR12Q  and,  especially,  LQAC-2 
have  bluer  W1-W2  colors  that  fall  outside  the  Mateos  et  al. 
(2012)  criteria  (Figure  9).  To  show  that  this  is  due  to  lower 
AGN/host  galaxy  luminosities,  we  calculated  the  rest-frame 
5  GHz  luminosities  of  AGNs  with  radio  data  in  LQAC-2  by 
assuming  power-law  radio  SEDs  of  the  form  fv  oc  v~ a,  and 
calculating  a  directly  from  the  catalog  2  and  5  GHz  spectral 
energy  densities.  On  average,  AGNs  in  LQAC-2  have  power- 
law  indices  of  a  =  0.3  ±  0.6,  in  line  with  the  flat- spectrum 
radio  SEDs  typically  seen  in  AGNs.  Using  a  to  correct  for 
redshift,  we  find  that  for  sources  below  a  redshift  of  z  <  0.5,  the 
mean  5  GHz  luminosity  is  L$GHz  =  2.5  x  1041  erg  s_1, 
while  for  sources  above  a  redshift  of  z  >  0.5  the  mean 
5  GHz  luminosity  is  L5GH z  =  7.9  x  1043  erg  s_1.  AGNs  at 
higher  redshift  in  LQAC-2  are  therefore  much  more  dominant, 
and  so  tend  to  manifest  as  MIRAGNs.9 

Second,  at  very  high  redshifts  (z  >  2),  the  SEDs  of  even  pure 
AGNs  begin  to  move  out  of  the  mid-IR  color-color  space 
defined  by  our  chosen  AGN  selection  criterion  (see  Figure  5  in 
Mateos  et  al.  2012).  This  effect  can  be  seen  in  Figure  9,  and 
affects  the  DR12Q  catalog  especially  due  to  the  higher  average 
redshift  (z  =  2.1)  of  AGNs  in  the  catalog. 

If  we  exclude  sources  in  LQAC-2  with  z  >  2  and  R  ^  19, 
93.0%  of  the  remaining  sources  are  MIRAGNs.  If  we  make  the 
same  redshift  cutoff  for  sources  in  DR12Q  with  g  ^  20,  98.2% 
of  the  remaining  sources  are  MIRAGNs.  We  note  that  many 
(32.7%  and  51.0%,  respectively)  of  the  high  redshift  sources 
excluded  in  our  sample  would  have  been  included  if  we  had 
used  the  one-color  criteria  (W1-W2  >  0.8)  of  Stem  et  al. 
(2012).  The  AGN  criteria  outlined  in  Section  3  thus  minimizes 


Although  more  luminous  AGNs  may  also  be  preferentially  found  in  more 
luminous  host  galaxies.  See,  for  example  Hamilton  et  al.  (2008). 
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(a)  (b) 

Figure  8.  (a)  Mid-IR  color-color  plot  for  R  <  19  sources  in  LQAC-2,  and  (b)  for  DR12Q  with  g  <  20.  The  black  lines  are  the  Mateos  et  al.  (2012)  demarcation.  (See 
Section  4.4.) 


n  LQAC-2  MIRAGNs 
n LQAC-2  non-MIRAGNs 
nDR12Q  MIRAGNs 
-DR12Q  non-MIRAGNs 

z 


Figure  9.  Distribution  of  redshifts  for  AGNs  in  the  LQAC-2  and  DR12Q 
catalogs  meeting  the  WISE  photometry  requirements  outlined  in  Section  3,  and 
brighter  than  R  <  19  and  g  <  20,  respectively. 


leakage  of  stars  into  our  sample  at  the  cost  of  some  missed 
AGNs  (i.e.,  false  negatives,  or  “type  II  errors”). 

Finally,  DiPompeo  et  al.  (2015)  recently  published  a  catalog 
of  over  5.5  million  quasar  candidates  with  SDSS+WISE 
photometry.  Extending  the  XDQSOz  model  of  Bovy  et  al. 
(2012)  to  include  WISE  photometry,  and  using  a  training  set  of 
spectroscopically  confirmed  quasars,  they  assigned  quasar 
probabilities  Pqso  for  unresolved  sources  from  SDSS  DR8, 
as  well  as  computing  photometric  redshifts,  which  they  find  to 


be  significantly  improved  by  the  addition  of  WISE  photometry. 
We  cross-matched  our  sample  to  their  catalog  to  within  R  <  l", 
as  before,  finding  227,011  matches.  We  found  several  reasons 
for  non-matches,  which  we  outline  below. 

The  first  and  most  significant  reason  for  non-matches  with 
our  sample  is  the  limiting  magnitude.  Of  the  ~3.7  million 
sources  in  their  catalog  with  high-quality  SDSS  photometry 
(PSF  g  S/N^5),  only  ~9%  have  g  <  20. 10  The  second 
reason  for  non-matches  is  the  differing  photometry  require¬ 
ments  between  our  sample  and  the  XDQSOz  catalog.  Of  the 
197,635  sources  in  the  XDQSOz  catalog  with  g  <  20  not  in 
our  sample,  91,711  are  not  in  the  A11WISE  catalog  at  all, 
likely  due  to  the  independent  “forced  photometry”  performed 
on  WISE  data  for  the  XDQSOz.* 11  Of  the  sources  not  in  our 
sample  that  are  in  the  A11WISE  catalog,  only  5993  fulfill  our 
photometry  quality  requirements  as  outlined  in  Section  3.  For 
the  sources  in  XDQSOz  that  are  in  the  A11WISE  catalog 
with  clean  WISE  photometry  brighter  than  g  <  20,  130,420/ 
136,413  (95.6%)  are  in  our  sample.  Finally,  a  third  reason 
for  non-matches  is  again  the  redshift  limitation  of  our 
chosen  AGN  selection  criterion  of  z  ~  2.  If  we  retain  only 
sources  with  z<2,g<  20,  and  with  clean  WISE  photometry, 
then  110,756/112,138  (98.8%)  of  the  sources  in  XDQSOz 
are  in  our  sample.  We  note  that  relaxing  our  match  radius  to 
R  <  3"  does  not  significantly  alter  our  results,  yielding  a 
total  of  236,963  matches  between  our  sample  and  the 
XDQSOz  catalog,  with  98.4%  of  the  sources  in  XDQSOz 
with  z  <2,  g  <  20,  and  with  clean  WISE  photometry  in  our 
sample. 


10  These  are  Pogson  magnitudes  we  calculated  as  explained  in  https://www. 
sdss3.org  /  dr8  /  algorithms  /  magnitudes. php. 

11  Within  a  1"  radius;  relaxing  the  radius  to  3"  yields  an  additional  14,476 
candidate  sources  in  the  A11WISE  catalog. 
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Table  1 

MIRAGN  Data 


All  WISE 

R.A. 

decl. 

W1-W2 

W2-W3 

Wl 

§ 

z 

z-type 

J005947.42-01 1010.8 

14.947587 

-1.169694 

1.26 

2.74 

14.73 

18.82 

1.123 

s 

J005939.81-01 1445.7 

14.915884 

-1.246029 

0.73 

2.99 

14.81 

21.35 

0.457 

s 

J010016.47-020912.8 

15.068651 

-2.153577 

1.31 

3.57 

15.15 

2.009 

s 

J005858. 94-010459.4 

14.745614 

-1.083188 

1.01 

2.92 

15.12 

J005 847 .49-0 1 0549 .7 

14.697876 

-1.097150 

0.71 

2.76 

12.26 

0.047 

s 

J0102 10.49-015630.0 

15.543743 

-1.941685 

0.92 

2.38 

14.74 

0.6 

p 

J010323.03-012637.0 

15.846000 

-1.443626 

1.19 

3.19 

14.79 

J010032. 21-020046.0 

15.134236 

-2.012790 

1.11 

3.35 

12.86 

0.227 

s 

J005847.77-020808.6 

14.699062 

-2.135742 

1.27 

2.89 

15.48 

J005936.26-0 1 3003 .8 

14.901093 

-1.501077 

1.14 

3.51 

15.17 

19.46 

1.016 

s 

J0058 14.49-01 1507.0 

14.560381 

-1.251954 

0.99 

2.61 

14.39 

J010220.08-005743.5 

15.583695 

-0.962098 

1.31 

2.95 

15.07 

J005804. 98-015015. 7 

14.520752 

-1.837710 

0.88 

3.37 

13.84 

0.239 

s 

JO  10246.00-0 12600.5 

15.691669 

-1.433499 

1.38 

3.50 

15.31 

J010248. 37-021532.9 

15.701556 

-2.259148 

1.65 

3.77 

15.38 

J010304. 14-010040. 1 

15.767260 

-1.011162 

0.90 

2.89 

15.24 

J0101 14.58-021 141.1 

15.310782 

-2.194774 

1.02 

3.70 

16.04 

J010010.93-014909.5 

15.045565 

-1.819306 

1.05 

3.43 

15.66 

JO  1 0003 .46-0 1 5427 .7 

15.014454 

-1.907698 

1.58 

4.19 

17.25 

J010249. 02-010545.0 

15.704260 

-1.095849 

1.20 

2.68 

15.01 

19.89 

1.038 

s 

J005956. 19-010722.6 

14.984129 

-1.122969 

1.13 

3.74 

15.94 

J005957. 93-0053 1 1 .4 

14.991392 

-0.886508 

1.46 

3.37 

15.85 

J010129. 88-0105 15. 9 

15.374504 

-1.087752 

1.62 

3.22 

15.15 

J0101 04. 16-0059 18.6 

15.267358 

-0.988513 

1.40 

3.71 

15.84 

J005939. 26-014846.1 

14.913610 

-1.812812 

1.41 

2.99 

14.98 

Note.  “A11WISE”  is  the  “designation”  column  in  the  A11WISE  catalog;  R.A.  and  decl.  are  the  A11WISE  catalog  coordinates,  in  J2000.  g-band  magnitudes  come  from 
LQAC-2  where  available,  and  DR12Q  otherwise.  Redshifts  come  preferentially  from  LQ AC-2,  then  the  non-flagged  pipeline  redshifts  from  DR12Q,  then 
MILLIQUAS.  Photometric  redshifts  from  MILLIQUAS  are  flagged  in  z-type  as  “p.”  The  remaining  columns  are  the  unique  string  identifiers  found  in  the  matched 
catalogs,  but  have  been  omitted  to  save  space. 

(This  table  is  available  in  its  entirety  in  machine-readable  form.) 


4.5.  New  AGNs 

To  estimate  how  many  sources  in  our  sample  are  expected  to 
be  previously  uncataloged  AGNs,  we  cross-matched  our 
sample  to  the  Million  Quasars  (MILLIQUAS)  Catalog,  version 
4.5, 12  a  heterogenous  compilation  of  all  known  or  candidate 
AGNs/QSOs  through  2015  May,  which  contains  1,153,110 
entries.13  It  includes  quasar  data  from  the  NAS  A/IP  AC 
Extragalactic  Database  (NED),  SIMBAD,  and  SDSS-DR12Q. 
Cross-matching  our  source  list  to  the  MILLIQUAS  catalog  to 
within  R  <  1"  yields  202,203  sources;  however,  in  order  to 
more  completely  assess  the  number  of  sources  in  our  sample 
already  in  the  MILLIQUAS  catalog,  we  relaxed  our  cross¬ 
match  radius  to  R  <  10",  obtaining  210,534  matches.  To 
estimate  the  level  of  contamination  at  this  cross-match  radius 
by  random  matches,  we  cross-matched  a  scrambled  version  of 
our  sample  coordinates  with  MILLIQUAS,  obtaining  976 
matches  within  R  <  10",  a  contamination  level  of  0.46%. 
Expanding  our  match  radius  to  R  <  30"  only  produces  an 
additional  6216  matches,  and  an  unacceptable  level  of 
contamination  by  random  matches  of  4.0%.  Our  sample  of 
mid-IR  selected  AGNs  is  therefore  expected  to  contain 
approximately  1.1  million  uncataloged  AGNs.  We  give  an 

12  http://quasars.org/milliquas.htm 

13  MILLIQUAS  includes  the  Half  Million  Quasars  (HMQ)  Catalog 
(Flesch  2015),  but  the  latter  excludes  candidate  AGNs/QSOs.  Of  AGNs  in 
the  HMQ,  55.33%  of  sources  with  R  <  19  are  MIRAGNs,  suggesting  a 
prevalence  of  non-dominant,  low-redshift  sources  or  sources  at  high  redshift. 
Indeed,  by  further  excluding  sources  in  the  HMQ  with  z  <  0.5  and  z  >  2.0  (see 
Section  4.4),  79.19%  of  sources  in  the  HMQ  are  MIRAGNs. 


example  of  our  sample,  cross-matched  to  the  LQAC-2, 
DR12Q,  and  MILLIQUAS  catalogs  to  within  R  <  1",  in 
Table  1. 

5.  CONCLUSIONS 

We  have  explored  the  use  of  WISE- selected  AGNs  to  derive 
a  highly  reliable  sample  of  extragalactic  sources  for  astrometric 
purposes.  Our  primary  conclusions  are  as  follow. 

1.  Using  the  two-color  AGN  criteria  of  Mateos  et  al.  (2012) 
and  strict  photometric  quality  requirements,  we  derive  a 
sample  of  1,354,775  MIRAGNs  from  the  A11WISE 
source  catalog.  Approximately  1.1  million  of  these  were 
previously  uncataloged. 

2.  From  a  reliability  analysis  using  499,724  stars  from  the 
PPMXL  catalog,  we  estimate  that  the  fraction  of  stars  in 
our  sample  is  extremely  small,  ^0.041%,  and  conclude 
that  this  technique  is  extremely  reliable. 

3.  The  use  of  mid-IR  color  selection  results  in  a  high  level 
of  completeness,  and  we  estimate  that  our  sample 
contains  ^8.3  x  104  AGNs/QSOs  with  g-band  magni¬ 
tudes  below  <20  that  fall  outside  the  SDSS  footprint,  a 
significant  fraction  («17%)  of  the  number  of  QSOs 
expected  to  be  discovered  by  Gaia. 

In  a  subsequent  paper,  we  will  use  the  sample  derived  here  to 
look  for  optical  signatures  of  previously  undetected  AGNs 
using  multi-epoch  URAT  observations.  URAT  is  an  all-sky 
astrometric  survey  in  the  visible,  and  is  a  follow-up  project  to 
the  previous  UCAC  program.  Utilizing  the  red-lens  from  the 
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UCAC  program,  the  telescope  has  been  completely  redesigned. 
The  new  4-shooter  camera  consist  of  four  large  10,560  x 
10,560  pixel  CCDs,  with  a  combined  single  exposure  covering 
28  square  degrees  of  the  sky  at  a  resolution  of  0"9pix_1.  The 
newly  released  URAT1  catalog  (Zacharias  2015;  Zacharias 
et  al.  2015)  contains  accurate  positions  (typically  10-30  mas 
std.  error)  of  220  million  stars  in  the  3-18.5  mag  range,  mainly 
in  the  northern  hemisphere.  Proper  motions  have  been  obtained 
for  85%  of  these  stars  utilizing  the  Two  Micron  All  Sky  Survey 
(2MASS)  as  the  first  epoch.  UR  ATI  is  also  supplemented  by 
2MASS  and  AAVSO  Photometric  All-Sky  Survey  (APASS) 
photometry.  We  will  characterize  the  astrometric  and  photo¬ 
metric  variability  of  the  AGNs  we  detect  with  URAT,  we  will 
provide  an  optical  catalog  of  these  objects  and  their  derived 
properties,  and  we  will  explore  the  utility  of  the  sources 
identified  in  this  paper  for  future  ICRF  work. 

We  thank  the  anonymous  referee  for  their  very  thorough 
review  which  significantly  improved  the  clarity  of  this  work. 
We  also  thank  Daniel  Stem  (JPL/Caltech)  for  his  helpful 
discussion  on  this  project,  and  we  thank  Ciprian  Berghea 
(USNO)  for  his  help  with  Pan-STARRS. 
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